SWASH: A Naive Bayes Classifier for Tweet Sentiment Identification
نویسندگان
چکیده
This paper describes a sentiment classification system designed for SemEval-2015, Task 10, Subtask B. The system employs a constrained, supervised text categorization approach. Firstly, since thorough preprocessing of tweet data was shown to be effective in previous SemEval sentiment classification tasks, various preprocessessing steps were introduced to enhance the quality of lexical information. Secondly, a Naive Bayes classifier is used to detect tweet sentiment. The classifier is trained only on the training data provided by the task organizers. The system makes use of external human-generated lists of positive and negative words at several steps throughout classification. The system produced an overall F-score of 59.26 on the official test set.
منابع مشابه
GTI at SemEval-2016 Task 4: Training a Naive Bayes Classifier using Features of an Unsupervised System
This paper presents the approach of the GTI Research Group to SemEval-2016 task 4 on Sentiment Analysis in Twitter, or more specifically, subtasks A (Message Polarity Classification), B (Tweet classification according to a two-point scale) and D (Tweet quantification according to a two-point scale). We followed a supervised approach based on the extraction of features by a dependency parsing-ba...
متن کاملAn Empirical Study on Machine Learning for Tweet Sentiment Analysis
Tweet sentiment analysis has been an effective and valuable technique in the sentiment analysis domain. As the most widely used approach for tweet sentiment analysis, machine learning algorithms work well on the sentiment classification, just as they have been successfully applied for many other purposes. In this thesis, we conduct a systematic and thorough empirical study on the machine learni...
متن کاملI act , therefore I judge : Network sentiment dynamics based on user activity change Supplemental Material
We annotate individual posts following the approach in [1], [2]. Tweets are grouped by topic based on included topic hashtags. For example, tweets relating to the topic of president Barack Obama contain the hashtag #obama within them. The topic-related tweets often contain other hashtags which we assign a preliminary sentiment probability (positive and negative) using the Multinomial Naive Baye...
متن کاملSentiment Analysis for Social Media
The proposed system is able to collect useful information from the twitter website and efficiently perform sentiment analysis of tweets regarding the Smart phone war. The system uses efficient scoring system for predicting the user’s age. The user ‘gender is predicted using a well trained Naïve Bayes Classifier. Sentiment Classifier Model labels the tweet with a sentiment. This helps in compreh...
متن کاملTwitter Sentiment Analysis: Lexicon Method, Machine Learning Method and Their Combination
This paper presents a step-by-step methodology for Twitter sentiment analysis. Two approaches are tested to measure variations in the public opinion about retail brands. The first, a lexicon-based method, uses a dictionary of words with assigned to them semantic scores to calculate a final polarity of a tweet, and incorporates part of speech tagging. The second, machine learning approach, tackl...
متن کامل